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As an introduction to this paper on mathematical statistics the 
writer cannot refrain from quoting the first paragraph from the intro- 
duction of an extremely sound treatise on non-mathematical statis- 
tics, Zizek's Statistical Averages, a translation by Persons.f 

"In nearly all branches of statistical investigation averages or means are very 
often used for various purposes of the greatest significance. Averages are, therefore, 
undoubtedly to be reckoned among the most important aids in statistical method. 
Even Guerry, the founder of moral statistics, indicated the extraordinary importance 
of averages in the following definition of statistics: 'The science of statistics consists 
essentially in the methodological enumeration of variable elements, whose mean it 
determines.' Edgeworth defines statistics as 'the science of those means which are 
presented by social phenomena'; and similarly Bowley says, 'Statistics may rightly 
be called the science of averages.' The application of averages has, as is well known, 
given rise to controversies of various kinds, which fill a considerable part of statistical 
literature. Not infrequently the incorrect use of averages has also led to erroneous 
conclusions and to contradictions which have shaken our confidence in statistics. 
Averages, indeed, are only applicable under strictly defined conditions, and conclu- 
sions based on averages are likewise permissible only within well defined limits. It 
is the task of statistical science to investigate the application and use of averages 
from the general methodological standpoint, and to determine the part which aver- 
ages should play in statistical method." 

In astronomy and in other fields in which precise measurements are 
necessary, the average of a series of independent measurements of 
equal weight on an unknown quantity is assumed to give the best 
approximation to, i. e., the most probable value of, the magnitude in 
question. But the computation of the mean, instead of permitting a 
final answer to the problem, only gives rise to other questions of a 
more involved character. Thus, we may require the "probable error" 

* Sections III and IV of this paper will appear in the September issue of the Quarterly Publications. 
t New York: Henry Holt and Company, 1913. 
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of the mean in order to estimate the extent to which we may rely on 
this average, or if this mean is to be combined with other means, it is 
necessary to know the approximate "weight " of each in order to insure 
a proper combination of observations. 

In statistics it is likewise of the utmost importance that we be able 
to measure the dependability and the significance of averages. Thus, 
it may be quite necessary that we know that the average or normal 
weight of a male infant sixteen months old is 23.25 pounds, or that the 
average weight of a man who is thirty-seven years old and six feet tall 
is 179 pounds; but it is likewise important that we know to what 
extent these averages may be relied upon and be able to state to 
what extent a sixteen months old infant weighing 19 pounds or a 
thirty-seven year old male six feet in height and weighing 200 pounds, 
is abnormal. 

Again, if from past experience we observe that certain abnormal 
conditions of rainfall and temperature seem to be associated with 
abnormal crops, we immediately feel that it would be desirable to 
know to just what extent these phenomena are on the average asso- 
ciated, and to be able to determine, if possible, a functional relation 
between cause and effect. Assuming that a mathematical representa- 
tion of the law of cause and effect is obtainable, we must hesitate to 
place any confidence on any dependent prediction until we have investi- 
gated the dependability of the mathematical predictions. 

It is quite clear that the dependability of a mean must be a function 
of both the number of observations used in computing the mean and 
the magnitudes of the variations about the mean. It follows,therefore, 
that a critical investigation of the distribution of magnitude-deter- 
minations about their mean is essential. 

The fundamental problem of mathematical statistics is to develop 
an analytical expression that is capable of representing any distribution 
of magnitude-determinations, i. e., variates. The Gaussian or Normal 
Curve of Error, Pearson's Generalized Probability Curves, and Char- 
lier's Type A and Type B curves are all the results of investigations of 
this fundamental problem. 

In this paper we shall first investigate the general problem of dis- 
tribution from the point of view of a general mathematical representa- 
tion rather than from the point of view of the theory of probability. 
The writer believes it possible to justify this deviation from the usual 
method of presentation. 
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Section I 

DISTRIBUTIONS OF GRADUATED VARIATES 

Definition: Graduated variates are expressions of observed magnitudes 
which, from their nature, do not exist as integers but which may differ 
by infinitesimals. 

We assume that variations from the mean follow some unknown 
law and that the observed class frequencies are approximations to 
corresponding areas under a theoretical frequency function, which 
affords a mathematical representation of the unknown law of 
distribution. 

Let f(x) be any frequency function which vanishes at x = — k and 
x = +Z 2 , where the origin is taken at any point within the range. The 
limits may be either both finite, both infinite, or one finite and the 
other infinite. 

Let the range (— l x , k) be divided into segments of equal length, Ax, 
and let ordinates be erected at each of the points of division. Now 
let rectangles be inscribed on these segments with altitudes equal to 
their respective right-hand boundaries. 
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Choosing one of these points of division as origin, the area of the 
rectangle whose right-hand boundary is at x is 

Ax ■ f(x ) 

and that of any other rectangle whose right-hand boundary is c • Ax 
distant is 

Ax • /(a; +c-Ax). 

Let us now set up the function 
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h—xt 
Ax 

22 Aa; • f(x +c-Ax)e< n + c ' ^'^ 
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where i = * — 1 and to is a parameter, and observe that if we multiply- 
each term of this summation by 

and integrate the resulting series term by term with respect to co we 
obtain the following identity 
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■ du = 2wf (x ). 



This follows immediately from the fact that the general term of the 
summation after multiplication by 

g— x wi 

and subsequent integration yields, for c^O 

ir/Ax 

Ax • $ (xo+c-Ax) / (cosc-Aa; • w+i smc-Ax-w)do> = 0, 

—v/Ax 

since c is necessarily an integer. 
For c=0we have 

v/Ax 



Ax ■ f (x ) dco = 2irf (x ) . 



We have therefore 

w/Ax 

(1) /(*o) = ^/e-™ 

— w/Ax 



—w/Ax 

U—xt 
Ax 



2^ Ax ■ f (x +c-Ax)e<-*»+ c ' ^ 



Ij+Xtj 

Ax 



dco. 



Now permitting Ax to approach zero we have 

go 'a 

(2) / (a*) = Y v J e ~ nWi { f (^^^ \ d<a - 



Formula (2) is merely a form of Fourier's Integral. 
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If we now place 

h 
(3) Jj (x)^dx = e **«*-«f 1 1 +c 3 ^+c 4 ^- 4 +...}, 

the coefficients, c n , may be evaluated by expanding 

h 
f(x)e- a+mix ~ Cl)+c '2dx 

-h 

in a power series of oi and imposing the condition that the coefficients 
of co°, co 1 and co 2 of the resulting series must equal 1, 0, and respectively. 

For values of c„, (ra>_3) 

u 

and applying the operator formula 

<t>(D)e ax V=e ax <t>(D+a)V 
we may write 



it 
• = 6 -*I>« // ( a! )e" i( -» J+, ' , f d*] , 



(4) c^e"" ff (x)e^ x - Cl) j (D„+i s-cJVlf j ds] 

-.-.-/ { (x-cy-^ix-c^+^ix-cO"-* 
-h 

+ . . . >f(x)dx, 

where 

n (r) =n(n-l)(n-2) . . . (n-r+1). 

Imposing the condition that (4) must equal 1, 0, and for ra = 0, 1, 
and 2 respectively, we have 

1 

e h > 

ff(x)dx 
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xf (x)dx 



C\ 






Ci = 



h ' 

/ / (x)dx 

-h 

h 

/ (x — ci) 2 f(x)dx 
^h 

h 

I f (x)dx 

-h 

We see, then, that Ci is the distance from the arbitrary origin to the 
mean and that <% is therefore the second moment of unit frequency 
about the mean, or in other words is the square of the standard 
deviation. 

Representing, as is customary, by 

b, the distance from the arbitrary origin to the mean, 

c, the standard deviation, 

N, the total frequency of the distribution, 
v' n , the n th moment of unit frequency about the arbitrary origin, 

h 



i.e. *'==*■ 



h 



{x)dx 



h > 

Jj{x)dx 
-ll 

v n , the n th moment of unit frequency about the mean 

h 

(x-c 1 ) n f(x)dx 
i.e. v n = ^ Tt 

ff(x)dx 

-h 
we obtain, 

(5) »'„ = /„- n C 1 6/„_ 1 +„C 2 s 6, 1 „_ 2 - • • • 



726 American Statistical Association 

Referring to equations (2) and (3) we see that 
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If we now set 



-^-- I i +C3 ^ +C4 ^ + 



do). 



^ = 2^ / 



,-C^*-*^ 



and note that 
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2ir J 
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we have finally, dropping the subscript on x, that any ordinate 

(6) / (a; ) = iv{^-|4 3) +^ 4) -|^ 5) + • • • }, 



where from equation (5) 



(7) 



d = v 3 

Ci = Vi— 3c 4 

C& = V5 — 10<T 2 V 3 

c 6 = i'6-15(rV4+30o- 6 
c-i = vi — 21o- 2 f5+ Vbha^vz 
c 8 = n -28<rV 6 +210o-V, -315<r 8 
etc. 



From the well-known definite integral 
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we see that 

(8) 



4>x= — T= e 
°V2tt 



(x-b)* 
2a* 



Since the coefficients, c„, are functions of the unknown law of dis- 
tribution it would appear at first thought that we are now as far from 
having obtained a workable analytical expression for any distribution 
of graduate variates as we were at the start. 

But although fluctuations in the values of individual class frequen- 
cies may arise as the result of "random sampling," we assume that the 
values of v n as computed from any observed frequency distribution 
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will approximately equal the corresponding values which are functions 
of the unknown law of distribution. Certain theorems involving the 
theory of "probable error" enable us to ascertain the degree of 
approximation involved in this assumption. 

It follows, then, that if equation (6) is so rapidly convergent that 
terms after the third or fourth may be neglected, it affords us a simple 
representation of any distribution. 

The rapidity with which this series converges, however, depends 
upon the extent to which the generating function, <$> x , is a fair approxi- 
mation of the unknown law of distribution. In general, if the dis- 
tribution be approximately symmetrical, equation (6) is very rapidly 
convergent. If, however, the distribution be bimodal or extremely 
asymmetrical, series (6) as a function of the symmetrical generating 
function, <j> x , is for practical purposes of no value because 

(a) the number of required terms in (6) would necessitate an 
impracticable amount of labor; and 

(b) the probable errors of c„ for values of n greater than 5 or 6 are 
generally so large that the assumption that, for these, we may substitute 
the moments computed from the observed data for the theoretical 
moments, is open to serious criticism. 

It should be understood, however, that theoretically equation (6) is 
capable of representing any frequency function associated with grad- 
uated variates, but that convergence means one thing and rapid con- 
vergence quite another. 
In case the distribution is skew we may proceed as follows: 
Let 6 X be any arbitrarily chosen function which may be used as a 
statistical generating function and which, being a frequency function, 
may be expressed by means of equation (6) in the series 

(9) e x =M {^ *^+^> **>,+ . . . }, 

where d n is a function of the moments of 6 X . 
Eliminating <j> x between (6) and (9) we obtain 

(10) f(x) = ^ {fl.-^»+j^«-^.«»+ . . .} 
where 

A 3 = C3 — di 

At = Ci— di 
Ai = c&—di 
A<> = (& — <4) — 20d 3 (c3— <4) 
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The area under / (x) between any two limits may now be computed 
since 

b b 

(11) [f{z)d*-?L[f9jz-±9*> + ^9n- . . .]. 

Inasmuch as tables for both the definite integral from to x and 
D (n) (n = 1, 2, 3, 4, 5, 6) have been computed for 

1 (*-*>)' 
<t>x= — 7=e 2** 

it follows that in problems for which 4> x may be applicable, we are in 
a position to proceed with numerical computations. 

It is highly desirable, however, that investigations be made in the 
endeavor to establish a reference list of generating functions possessing 
various desired properties. If the indefinite integral of any of these 
generating functions could readily be computed, such an investigation 
would be of the greatest value. 

Again, since in the making of computations it would be much easier 
to obtain the numerical value of the n th difference of d x than for the 
n th derivative, it will be found of frequent advantage to utilize the fol- 
lowing relation 

d «Ja~* + a 1- . . 

V 2 3 
and write (10) and (11) as 

(12) f {x )=K !^ dx ~^A%+^A%- . . .} and 

b ~ ~~ b 

(13) ff{x)dx=jL\ 6 x dx-^-A%+^A%- . . .1 respectively, 

a a 

where 

B 3 = A Z B' 3 = A 3 

B^Ai+QA, B' 4 =Ai+4:A3 

5 6 = A t +10A t +Z5A, b , , = A s +^A 4l +j-A, 

B, = A,+15A b +85A,+225A 3 B'< i = A <i +12A 5 +-l r A i +100A 3 



In order that formulas 12 and 13 may be useful, it is necessary that 
the class interval be taken as the unit of x for B x and for all computed 
moments. 
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Section II 

DISTRIBUTIONS OF INTEGRAL VARIATES 

Definition : Integral variates are expressions of observed magnitudes 
which, from their nature, exist only as integers. For example: num- 
ber of petals of a flower, number of persons in a family, etc. 

We assume that variations from the mean follow some unknown law 
and that the observed class frequencies are approximations to corre- 
sponding ordinates of the theoretical frequency function which affords 
a mathematical representation of the unknown law of distribution. 

We may readily write down an expression from which we may develop 
the general law of distribution for integral variates by placing c equal 
to unity in formula (1) of the preceding section. We have, then, 



h j 



(i4) ^ x ^=hj e i^r 

Placing now 

V /(*)**"= *•+*•*-"£ {l +03^+04^+ . . . 

x=-h *• L L 

and proceeding exactly as before, we obtain, corresponding to (6), 
(15) / (*) -N { *,- j|*,»+^ w - • • • } 

where now the values of c n are the same as those given in (7) if we 
now define 



Z(g-cO n /(s) 
2/(s) 



and 
(16) 



71 

1 f aW 



This <£ x may be eliminated as before, producing expansions corre- 
sponding to (10) and (12) . 
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Discussion of Sections I and II 

In Uber das Fehlergesetz and Die zweite Form des Fehlergesetzes, 
Charlier has shown that on the hypothesis of elementary errors any 
law of error may assume one of the two forms. 

Type A 

<t> x +A 3 <j> x 3) +A i <t> x 4) + • • • 
TypeB 

where 



IT 






and 



-x " 
^ w = £_ / e »«»« cos (X sin w-Zw)dw 



o 
e A 



\x 

if a; be an integer. 

Owing to an oversight, v instead of oo appears as each limit of 4> x . The 
constants A n and B n are determined in Uber die Darstellung willkur- 
licher Funktionen by applications of the method of moments, but appear 
to differ from the ones developed herein for values of n greater than (5) 
since by our general formula (5), 

c 6 = »'6-15<r 2 j'4+30o- 6 , 
whereas Charlier gives 

c* = H — 15ff 2 j'4+ 15a- 6 . 

For values of n less than 6 the results agree. 

As the result of the present investigation it would appear that there 
is no essential difference between Charlier's Type A and Type B curves, 
aside from the fact that both are merely special cases of formulas (10) 
and (12) when we define our generating functions as 

1 (x-W 

6 X =* =e 2 a ' 

aV2ir 



and 



e~\ x 
d x = — | — respectively. 

\x 
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The fact that the second generating function may assume asymmet- 
rical as well as nearly symmetrical forms renders the Type B curves 
more powerful than those of Type A. 

It would appear, however, that for graduated variates the con- 
tinuous limit of Poisson's Exponential Binomial Limit, namely, Pear- 
son's Type III curve, 

y = yoe- yx (l + -Y a 



a 

placing i>z = v 3 , might be more logically applied. 

Again, if we do not impose on the above generating function the 
condition that Vi = v s , it will be noted that any law of distribution 
may be written, 

V 5) + 



I [4 \5 



where X is the above mentioned Pearson's Type III function. Again 
we might use 

J(x)=n! [ x +^A%-^A%+ • • • }• 

Ordinarily 

f(x)=N^e x +^A%j 



will give practically the same results as any one of Pearson's General- 
ized Probability Curves, and will involve far less labor. 



